How accurately can 21 cm tomography constrain cosmology? 
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There is growing interest in using 3-dimensional neutral hydrogen mapping with the redshifted 
21 cm line as a cosmological probe. However, its utility depends on many assumptions. To aid 
experimental planning and design, we quantify how the precision with which cosmological parameters 
can be measured depends on a broad range of assumptions, focusing on the 21 cm signal from 6 < z < 
20. We cover assumptions related to modeling of the ionization power spectrum, to the experimental 
specifications like array layout and detector noise, to uncertainties in the reionization history, and 
to the level of contamination from astrophysical foregrounds. We derive simple analytic estimates 
for how various assumptions affect an experiment's sensitivity, and we find that the modeling of 
reionization is the most important, followed by the array layout. We present an accurate yet robust 
method for measuring cosmological parameters that exploits the fact that the ionization power 
spectra are rather smooth functions that can be accurately fit by 7 phenomenological parameters. 
We find that for future experiments, marginalizing over these nuisance parameters may provide 
almost as tight constraints on the cosmology as if 21 cm tomography measured the matter power 
spectrum directly. A future square kilometer array optimized for 21 cm tomography could improve 
the sensitivity to spatial curvature and neutrino masses by up to two orders of magnitude, to 
Aflk ~ 0.0002 and Am„ ~ 0.007 eV, and give a 4ct detection of the spectral index running predicted 
by the simplest inflation models. 



PACS numbers: 98.80.Es, 98.58.Ge 
I. INTRODUCTION 

Three-dimensional mapping of our Universe using the 
redshifted 21 cm hydrogen line has recently emerged as 
a promising cosmological probe, with arguably greater 
long-term potential than the cosmic microwave back- 
ground (CMB). The information garnered about cosmo- 
logical parameters grows with the volume mapped, so the 
ultimate goal for the cosmology community is to map our 
entire horizon volume, the region from which light has 
had time to reach us during the 14 billion years since our 
Big Bang. Figure [1] illustrates that whereas the CMB 
mainly probes a thin shell from z ~ 1000, and current 
large-scale structure probes (like galaxy clustering, grav- 
itational lensing, type la supernovae and the Lyman a 
forest) only map small volume fractions nearby, neutral 
hydrogen tomography is able to map most of our horizon 
volume. 

Several recent studies have forecast the precision with 
which such 21 cm tomography can constrain cosmolog- 
ical parameters, both by mapping diffuse hydrogen be- 
fore and during the reionization epoch [l|, @, El and by 
mapping neutral hydrogen in galactic halos after reion- 




FIG. 1: 21 cm tomography can potentially map most of 
our observable universe (light blue/light grey), whereas the 
CMB probes mainly a thin shell at z ~ 10 3 and current large- 
scale structure surveys (here exemplified by the Sloan Digital 
Sky Survey and its luminous red galaxies) map only small 
volumes near the center. This paper focuses on the convenient 
7 < z < 9 region (dark blue/dark grey). 
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ization [4|. These studies find that constraints based on 
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TABLE I: Factors that affect the cosmological parameter measurement accuracy. 



Assumptions 


Pessimistic 


Middle 


Optimistic 


Power 
modeling 


Ionization power 
spectrum modeling 


Marginalize over arbitrary 
P M o and P M 2 


Marginalize over constants 
that parametrize (A;) and 


No ionization power spec- 
trum, ^ss(k) oc Pat(Ic). 


Non-linear cut-off 

SCale femax 


lMpc -1 


2Mpc _1 


4Mpc _1 


Non-Gaussianity of 
ionization signals 


Doubles sample variance 


Negl 


gible 


Cosmological 


Reionization 
history 


Gradual reionization over wide range of redshifts 


Abrupt reionization at z < 7 


Redshift range 


7.3-8.2 


6.8-8.2 


6.8 - 10 


Parameter space 


Vanilla model plus optional 
parameters 




Vanilla model parameters 


Experimental 


Data 


MWA, LOFAR, 21CMA 


Intermediate case 


SKA, FFTT 


Array configuration 

a 


T) = 0.15 


t? = 0.8, n = 2 


Giant core 


Collecting area 


0.5 x design values 


Design values 


2 x Design values 


Observation time c 


1000 hours 


4000 hours 


16000 hours 


System 
temperature 


2 x T sys in [Hj 


T sys given in [Hj 


0.5 x T sya in [Hj 


Astrophysical 


Residual fore- 
grounds cut-off 

SC£l1g ^min 


4%/yB 


2-K/yB 


n/yB 



"For the FFTT, we consider only the case where all dipoles are in 
a giant core. 

b See designed or assumed values of A e in Table ITVl 

c Assumes observation of two places in the sky. 

d lt is hard to predict the level of the residual foregrounds after the 
removal procedure. To quantify contributions from other factors, 
we take the approximation that there is no residual foregrounds at 
k > fc m i n . Here in the table, yB is the comoving (l.o.s.) distance 
width of a single 2-bin. 



TABLE II: The dependence of cosmological constraints on the full range of assumptions. We assume the fiducial values given in 
Section Til Fl and employ the Fisher matrix formalism to forecast the la accuracy of 21cm tomography measurements. Unless 
otherwise noted, errors are computed by marginalizing over all other parameters in the first ten columns (which we refer to as 
the "vanilla" parameters). In "All OPT/MID/PESS", we use the assumptions of the right, middle and left column of Table U 
respectively. We assume that the total observing time is split between two sky regions, each for an amount in Table U using 
a giant/quasi-giant /small core array configuration where 100%/80%/15% of the antennae in the inner core are compactly laid 
at the array center while the rest 0%/20%/85% of antennae fall off in density as p ~ r~ 2 outside the compact core. 



Vanilla Alone 







Afi A 


Aln(fi m fi z ) 


Al„(S2 h h^) 


Ana 


A In A s 




Ax 


Ai H P ») " 


A2 H (7.5) 


Ax H (8.0) 


A5;h(9-2) 


Ari k 


Am v [eV] 


Ac 


Planck 




0.0070 


0.0081 


0.0050 


0.0033 


0.0088 





on (3 










0.025 


0.23 


0.0026 




All OPT 


0.0044 


0.0052 


0.0051 


0.0018 


0.0087 





0042 


0.0063 


0.0063 


0.0063 


0.0063 


0.0022 


0.023 


0.00073 


+LOFAR 


All MID 


0.0070 


0.0081 


0.0059 


0.0032 


0.0088 





0043 


0.18 


0.26 


0.23 




0.018 


0.22 


0.0026 




All PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 





0043 




51 


49 




0.025 


0.23 


0.0026 




All OPT 


0.0063 


0.0074 


0.0055 


0.0024 


0.0087 





0043 


0.0062 


0.0062 


0.0062 


0.0062 


0.0056 


0.017 


0.00054 


4-MWA 


All MID 


0.0061 


0.0070 


0.0056 


0.0030 


0.0087 





0043 


0.32 


0.22 


0.29 




0.021 


0.19 


0.0026 




All PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 





0043 




29 


30 




0.025 


0.23 


0.0026 




All OPT 


0.00052 


0.0018 


0.0040 


0.00030 


0.0087 





0042 


0.0059 


0.0059 


0.0059 


0.0059 


0.0011 


0.010 


0.00027 


+ SKA 


All MID 


0.0036 


0.0040 


0.0044 


0.0025 


0.0087 





0043 


0.0094 


0.014 


0.011 




0.0039 


0.056 


0.0022 




All PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 





0043 




1.1 


1.0 




0.025 


0.23 


0.0026 




All OPT 


0.00010 


0.0010 


0.0029 


0.000088 


0.0086 





0042 


0.0051 


0.0051 


0.0051 


0.0051 


0.00020 


0.0018 


0.000054 


4-FFTT 6 


All MID 


0.00038 


0.00034 


0.00059 


0.00033 


0.0086 





0042 


0.0013 


0.0022 


0.0031 




0.00023 


0.0066 


0.00017 




All PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 





0043 




0.0043 


0.0047 




0.025 


0.11 


0.0024 



a xn(z) refers to the mean neutral fraction at redshift z. 

6 FFTT stands for Fast Fourier Transform Telescope, a future 
square kilometer array optimized for 21 cm tomography as de- 
scribed in Dipoles in FFTT are all in a giant core, and this 
configuration does not vary. 
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the cosmic microwave background measurements can be 
significantly improved. However, all of these papers make 
various assumptions, and it is important to quantify to 
what extent their forecasts depend on these assumptions. 
This issue is timely because 21 cm experiments (like LO- 
FAR 0, 21 CM A [|, MWA [| and SKA 0) are still 
largely in their planning, design or construction phases. 
These experiments will be described in detail in Section 
III Gl In order to maximize their scientific "bang for the 
buck" , it is therefore important to quantify how various 
design tradeoffs affect their sensitivity to cosmological 
parameters. 

The reason that neutral hydrogen allows mapping in 
three rather than two dimensions is that the redshift of 
the 21 cm line provides the radial coordinate along the 
line-of-sight (l.o.s.). This signal can be observed from the 
so-called dark ages before any stars had formed, 

through the epoch of reionization (EoR), and even to the 
current epoch (where most of the neutral hydrogen is 
confined within galaxies). We focus in this study on the 
21 cm signal from 6 < z < 20 - the end of the dark ages 
through the EoR. This is the redshift range at which the 
synchrotron foregrounds are smallest, and consequently 
is the range most assessable for all planned 21 cm arrays. 

There are three position-dependent quantities that im- 
print signatures on the 21 cm signal: the hydrogen den- 
sity, the neutral fraction, and the spin temperature. 
For cosmological parameter measurements, only the first 
quantity is of interest, and the last two are nuisances. 
(For some astronomical questions, the situation is re- 
versed.) The 21 cm spin- flip transition of neutral hy- 
drogen can be observed in the form of either an absorp- 
tion line or an emission line against the CMB blackbody 
spectrum, depending on whether the spin temperature is 
lower or higher than the CMB temperature. 

During the epoch of reionization, the spin temperature 
is likely coupled to the gas temperature through Lya pho- 
tons via the Wouthuysen-Field Effect [12, El, and the gas 
in the inter-galactic medium (IGM) has been heated by 
X-ray photons to hundreds of Kelvin from the first stars 
(l4j |. If this is true, the 21cm signal will only depend on 
the hydrogen density and the neutral fraction. However, 
astrophysical uncertainties prevent a precise prediction 
for exactly when the gas is heated to well above the CMB 
temperature and is coupled to the spin temperature. In 
this paper, we follow [H, 0] and focus entirely on the 
regime when the spin temperature is much larger than 
the CMB temperature [l7|, El, HH , such that the observed 
signal depends only on fluctuations in density and/or the 
neutral fraction. Specifically, we focus on the time inter- 
val from when this approximation becomes valid (around 
the beginning of the reionization E3, EM Gj|) until most 
hydrogen has become ionized, illustrated by the darkest 
region in Figure [TJ Despite this simplification, the meth- 
ods that we apply to model the ionization fluctuations al- 
most certainly can be applied to model spin temperature 
fluctuations with minimal additional free parameters. 

In Table HI we list all the assumptions that affect the 



accuracy of cosmological parameter measurements, in- 
cluding ones about power modeling, cosmology, experi- 
mental design, and astrophysical foregrounds. For each 
case, we provide three categories of assumptions: one 
pessimistic (PESS), one middle-of-the-road (MID) and 
one optimistic (OPT). Since we wish to span the entire 
range of uncertainties, we have made both the PESS and 
OPT models rather extreme. The MID model is intended 
to be fairly realistic, but somewhat on the conservative 
(pessimistic) side. 

Before describing these assumptions in detail in the 
next section, it is important to note that taken together, 
they make a huge difference. Table |TT] illustrates this by 
showing the cosmological parameter constraints result- 
ing from using all the OPT assumptions, all the MID 
assumptions or all the PESS assumptions, respectively. 
For example, combining CMB data from Planck and 21 
cm data from FFTT, the la uncertainty differs by a fac- 
tor of 125 for J7k and by a factor of 61 for m v depending 
on assumptions. It is therefore important to sort out 
which of the assumptions contribute the most to these 
big discrepancies, and which assumptions do not matter 
much. This is a key goal of our paper. 

The rest of this paper is organized as follows. In Sec- 
tion [HJ we explain in detail the assumptions in the same 
order as in Table Q] and also present a new method for 
modeling the ionization power spectra. In Section IIII1 
we quantify how the cosmological parameter measure- 
ment accuracy depends on each assumption, and we de- 
rive simple analytic approximations of these relations. In 
Section HVl we conclude with a discussion of the relative 
importance of these assumptions, and implications for 
experimental design. 

II. FORECASTING METHODS & 
ASSUMPTIONS 

A. Fundamentals of 21cm cosmology 

1. Power spectrum of 21 cm radiation 

We review the basics of the 21 cm radiation tempera- 
ture and power spectrum only briefly here, and refer the 
interested reader to [2fj| for a more comprehensive discus- 
sion of the relevant physics. The difference between the 
observed 21 cm brightness temperature at the redshiftcd 
frequency v and the CMB temperature Tcmb is [HJ 

T , s _ 3c 3 feAi n g (x)[T s (x) - T C mb] , . 

b[X) ~ 327[* B i#r s (x)(l + z) 2 (dv {] /dr) ' ( ' 

where T$ is the spin temperature, nu is the number 
density of the neutral hydrogen gas, and A\q w 2.85 x 
10 _15 s -1 is the spontaneous decay rate of 21cm transi- 
tion. The factor dvn/dr is the gradient of the physical ve- 
locity along the line of sight (r is the comoving distance) , 
which is H{z)/(\ + z) on average (i.e. for no peculiar ve- 
locity). Here H(z) is the Hubble parameter at redshift 
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z. The spatially averaged brightness temperature at red- 
shift z is (in units of mK) 



T b w 23.885 H 



Tqub \ ( £lbh 2 



0.02 



0.15 1+j 

n m h 2 10 



1/2 



where ih is the mean neutral fraction, and Tg is the 
averaged spin temperature. If T$ S> Tcmb in the EoR, 
the 21cm emission should therefore be observed at the 
level of milli-Kelvins. 

To calculate the fluctuations, we rewrite Eq. ^ in 
terms of S (the hydrogen mass density fluctuation), S x 
(the fluctuation in the ionized fraction), and the gradi- 
ent of the peculiar velocity dv r /dr along the line of sight, 
using the fact that dv\\/dr = H(z)/(l + z) + dv r /dr: 



T 6 (x) =T h [l- + 5 X )} (1 + S) ( 1 - 

■ Ha or 



' F i MB \ 



(3) 



Here xi = 1— xh is the mean ionized fraction, and we have 
defined f b = f b /x R x [f s /(f s - T CMB )}. We write S v = 
(Ha)~ 1 dv r /dr. In Fourier space, it is straightforward to 
show that, as long as S <C 1 so that linear perturbation 
theory is valid, <5 u (k) = —fi 2 5, where /J, = k • n is the 
cosine of the angle between the Fourier vector k and the 
line of sight. In this paper, we restrict our attention to 
the linear regime. We will also throughout this paper 
assume Ts S> Tqmb during the EoR, making the last 
factor in Eq. ([3]) unity for the reasons detailed in Section 

m 

In Fourier space, the power spectrum PAr(k) of the 
21cm fluctuations is defined by (AT h *(k)AT b (k')) = 
(27r) 3 <5 3 (k— k')PAr(k), where AT b is the deviation from 
the mean brightness temperature. It is straightforward 
to show from Eq. ([3]) that, to leading order, 

^AT(k) = f 2 {[x&P M - 2x H P xS + Pcx] 

+2n 2 [x 2 i P ss - x H P xS ] + ^x^Pss} ■ (4) 

Here P xx = xfPg^g^ and P x $ = XiPg x s are the ionization 
power spectrum and the density-ionization power spec- 
trum respectively. For convenience, we define ^ss(k) = 
TjxlPssik), & xS (k) = f 2 x H P xS (k) and &> xx {k) = 
T b 2 P xx {k), so the total 21 cm power spectrum can be writ- 
ten as three terms with different angular dependence: 



PAT(k) = P^o{k)+P^{k)^ 2 + P^(k)^ 



where 



P^> 

P,A 



2{^ SS - & xS ), 



(5) 



(6) 
(7) 
(8) 



Since P^* involves only the matter power spectrum that 
depends only on cosmology, Barkana and Loeb [HI, [24| 



argued that in principle, one can separate cosmology 
from astrophysical "contaminants" such as 3? xx and & x g 
whose physics is hitherto far from known. We will quan- 
tify the accuracy of this conservative approach (which 
corresponds to our PESS scenario for ionization power 
spectrum modeling below) in Section [Ml 



2. From u to k 

The power spectrum P\T(k) and the comoving vec- 
tor k (the Fourier dual of the comoving position vector 
r) are not directly measured by 21cm experiments. An 
experiment cannot directly determine which position vec- 
tor r a signal is coming from, but instead which vector 



= x c x -\- OyCy -\- Afc z it is coming from, where {0 x ,0y) 
give the angular location on the sky plane, and Af is the 
frequency difference from the central redshift of a z-bin. 
For simplicity, we assume that the sky volume observed is 
small enough that we can linearize the relation between 
and r. Specifically, we assume that the sky patch ob- 
served is much less than a radian across, so that we can 
approximate the sky as flat 1 , and that separations in fre- 
quency near the mean redshift are approximately pro- 
portional to separations in comoving distance. In these 
approximations, if there are no peculiar velocities, 



A/ 



(9) 
(10) 



Here "_L" denotes the vector component perpendicular 
to the line of sight, i.e., in the (x, j/)-plane, and d a is the 
comoving angular diameter distance given by [43| 



d A (z) = -£r\(l k \-V 2 S 

Ho 



\Q k \ 1/2 



dz' 



o E(z') 



(11) 



where 



E(z) = = v/r» m (l + z) 3 + r! k (l + z)2 + fi A , (12) 

-Ho 

is the relative cosmic expansion rate and the function 
S(x) equals sin(x) if f2 K < 0, x if ilk = 0, and sinhx if 
fik > 0. The conversion factor between comoving dis- 
tances intervals and frequency intervals is 



y( z ) = 



A 2 i(l + z) ; 
H E(z) 



(13) 



1 The FFTT is designed for all-sky mapping (i.e. the field of view 
is of order 2tt). However, since the angular scales from which we 
get essentially all our cosmological information are much smaller 
than a radian (with most information being on arcminute scales), 
the flat-sky approximation is accurate as long as the data is an- 
alyzed separately in many small patches and the constraints are 
subsequently combined. 
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where A21 
21 cm line. 



21 cm is the rest-frame wavelength of the 



B. Assumptions about 



and 



We write the Fourier dual of as u : 



u x e x 



Une z i u \\ nas units of time), 
k is therefore 



The relation between u and 



U_L 

'"■II 



yh ■ 



(14) 
(15) 



In u-space, the power spectrum Pat(u) of 21cm signals is 
defined by (AT b *(u)AT fe (u')) = (27t) 3 <5( 3 Hu-u')Pat(u), 
and is therefore related to Pat (k) by 



Pat(u) 



1 



— P AT(k) . 



(16) 



Note that cosmological parameters affect Pat( u ) m 
two ways: they both change PatW and alter the ge- 
ometric projection from k-space to u-space. If cIa and 
y changed while PAr(k) remained fixed, the observable 
power spectrum Pat(u) would be dilated in both the 
uj^ and un directions and rescaled in amplitude, while 
retaining its shape. Since both cLa and y depend on the 
three parameters (ilk, ^A, h), and the Hubble parameter 
is in turn given by the parameters in Table [TT] via the 
identity h = ^/f2 m /i 2 /(l — tt\ — ilk), we see that these 
geometric effects provide information only about our pa- 
rameters (f2k, f^A, ^mh 2 ). Baryon acoustic oscillations in 
the power spectrum provide a powerful "standard ruler" , 
and the equations above show that if one generalizes to 
the dark energy to make Q\ an arbitrary function of z, 
then the cosmic expansion history H(z) can be measured 
separately at each redshift bin, as explored in 0, d, @ . 
21 cm tomography information on our other cosmologi- 
cal parameters (n s , A a , flbh 2 , m v , a, etc.) thus comes 
only from their direct effect on PAr(k). Also note that 
(f2k,^A) affect P\r(k) only by modulating the rate of 
linear perturbation growth, so they alter only the ampli- 
tude and not the shape of PAr(k). 

If we were to use Eq. (IT61 to infer PatO*) from the 
measured power spectrum Pat(u) while assuming incor- 
rect cosmological parameter values, then this geometric 
scaling would cause the inferred PAr(k) to be distorted 
by the so-called Alcock-Paczynski (AP) effect [H, l27j 
and not take the simple form of Eqns.ljn])-©. To avoid 
this complication, we therefore perform our Fisher matrix 
analysis directly in terms of Pat (u) , since this quantity 
is directly measurable without any cosmological assump- 
tions. 

The above transformations between u-space and r- 
space are valid when there are no peculiar velocities. The 
radial peculiar velocities v r that are present in the real 
world induce the familiar redshift space distortions that 
were discussed in Section [II A 11 causing /i 2 and fi 4 power 
spectrum anisotropics that were described there. 



During the EoR, ionized bubbles (HII regions) in the 
IGM grow and eventually merge with one another. Con- 
sequently, ^xxCk) and ^ x a(k) contribute significantly 
to the total 21cm power spectrum. The study of the 
forms of these two ionization power spectra has made 
rapid progress recently, particularly through the semi- 
analytical calculations [l?! [l!| [U and radiative 
transfer simulations (28l. l5l|. However, these models de- 
pend on theoretically presumed parameters whose values 
cannot currently be calculated from first principles. From 
the experimental point of view, it is therefore important 
to develop data analysis methods that depend only on 
the most generic features of the ionization power spec- 
tra. In this paper, we consider three methods — our 
OPT, MID and PESS models — that model 3? xx and 
3^x5 as follows: 



(OPT) 



(MID) 



^XX (^) 

>x S (k) 



(17) 



i&{k) 



(PESS) 



& 2 X [1 + a xx (fc P xx ) + (k P xx ) 2 ] ~ & 
bl & exp [-a xS (kR xS ) - {k R xS ) 2 ] &'~ 



(fid) 



(fid) 

66 



^xx(k) — arbitrary 
^xs(k) — arbitrary 



(18) 



(19) 



In the next three subsections, we explain these models in 
turn. 



1. OPT model 

It is likely that before reionization started (while ~ 
1 and & xx = £P x s = 0), hydrogen gas had already been 
sufficiently heated that Ts ^ Pcmb- In this regime, 
Eq. (fTT|) holds. This OPT scenario is clearly the sim- 
plest model, since the total 21cm power spectrum is sim- 
ply proportional to @> S5 : Pat (k) = ^ss{k){\ + [i 2 ) 2 . To 
forecast the la error, we use the Fisher matrix formalism 
}25j. Repeating the derivation in [44], the Fisher matrix 
for cosmological parameters A a (a — 1, . . . , N p ) is 



<91nP^° T (u) 



d\nP£ T (u) 



Ve- 



'(27T) 3 ' 

(20) 

where the integral is taken over the observed part of u- 
space, and Pat( u ) denotes the combined power spectrum 
from cosmological signal and all forms of noise. Here 
Vq = ft x B is the volume of the ©-space where f2 is the 
solid angle within the field of view (f.o.v.) and B is the 
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TABLE III: Fiducial values of ionization parameters adopted 
for Figure -R xx and R^s are in units of Mpc, while other 
parameters are unitless. 
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fin 




7xx 


b 2 * 


R*s 


"x<5 


9.2 


0.9 


0.208 


1.24 


-1.63 


0.38 


0.45 


0.56 


-0.4 


8.0 


0.7 


2.12 


1.63 


-0.1 


1.35 


1.47 


0.62 


0.46 


7.5 


0.5 


9.9 


1.3 


1.6 


2.3 


3.1 


0.58 


2. 


7.0 


0.3 


77. 


3.0 


4.5 


2.05 


8.2 


0.143 


28. 



frequency size of a z-bin. The Fisher matrix determines 
the parameter errors as AA a = (F _1 ) aa . 

For computational convenience, we subdivide u-space 
into pixels so small that the power spectrum is roughly 
constant in each one, obtaining 



E 



1 f dP AT (u) 
[^at(u)] 2 ^ d\ a 



( dP AT (u) 



(. 21 ) 

where the power spectrum measurement error in a pixel 
at u is 



<5Pat(u) = 



AT 



(u) _ Pat(u) + Pn(u±) 



N, 



1/2 



N, 



1/2 



(22) 



Here Pn{u±) is the noise power spectrum and will be 
discussed in detail in Section III G 31 note that it is in- 
dependent of uii and depends only on uj_ through the 
baseline distribution of the antenna array. 

N c = 2nk 2 sin(9AfcA6> x Vo1/(2tt) 3 (23) 

is the number of independent cells in an annulus summing 
over the azimuthal angle. We have the factor yJ\/N c in 
SPat instead of the normal yj2/N c because we only sum 
over half the sphere. 



2. MID model 

After reionization starts, both ionization power spectra 
& xx and £P x s make significant contribution to the total 
21cm power spectrum. We explore two different analysis 
methods — our MID and PESS models — for separating 
the cosmological signal from these astrophysical contam- 
inants (i.e. 8* X x and £^ x s)- 

Our MID model assumes that both ionization power 
spectra £P xx (k) and £P x s(k) are smooth functions of 
k which can be parametrized by a small number of 
nuisance parameters f3\,..., /? nion related to reionization 
physics. Combining these ionization parameters with 
our cosmological ones A a into a larger parameter set p a 
(a = 1, . . . , N p + riion), we can jointly constrain them by 
measuring Pat(u). 

In Appendix [2] we will describe a \ 2 goodness-of-fit 
test for quantifying whether this parametrization is valid. 
The Fisher matrix for measuring p a is simply 



1 



dP AT {u) dP AT (u) 



pixels 



[SP AT (u)]i d Po 



dp/3 



This Fisher matrix F a p is not block diagonal, i.e., there 
are correlations between the cosmological and ionization 
parameters, reflecting the fact that both affect 8? xx (k) 
and 8 > x s(k). The inversion of the Fisher matrix therefore 
leads to the degradation of the constraints of cosmologi- 
cal parameters. However, the total 21cm power spectrum 
is usually smaller in magnitude in the MID model than in 
the OPT model (see Eq. Q), giving less sample variance. 
This means that as long as noise in a 21cm experiment 
dominates over sample variance, the MID model will give 
weaker constraints than the OPT model, because of the 
degeneracies. For future experiments with very low noise, 
however, it is possible to have the opposite situation, if 
the reduction in sample variance dominates over the in- 
crease in degeneracy. This does of course not mean that 
the MID model is more optimistic than the OPT model, 
merely that the OPT model is assuming an unrealistic 
power spectrum. 

Having set up the general formalism, we now propose a 
parametrization specified by Eq. (|18[) . with fiducial val- 
ues of ionization parameters given in Table IIIII This 
parametrization was designed to match the results of 
the radiative transfer simulations in Model I of [28[ , and 
Figure [2] shows that the fit is rather good in the range 
k = 0.1 — 2 Mpc -1 to which the 21cm experiments we 
consider are most sensitive. 

The radiative transfer simulations implemented in [28j 
are post processed on top of a 1024 3 N-body simulation in 
a box of size 186 Mpc. Three models for the reionization 
history are considered in [28j |: 

1. In Model I, all dark matter halos above m coo i (cor- 
responding to the minimum mass galaxy in which 
the gas can cool by atomic transitions and form 
stars, e.g. m coo i ~ 10 8 M at z = 8) contribute 
ionizing photons at a rate that is proportional to 
their mass. 

2. In Model II, the ionizing luminosity of the sources 
scales as halo mass to the 5/3 power, i.e. more mas- 
sive halos dominate the production of ionizing pho- 
tons than in Model I. 

3. In Model III, which has the same source 
parametrization as in Model I except for doubled 
luminosity, minihalos with m > 1O 5 M0 absorb in- 
cident ionizing photons out to their virial radius 
unless they are photo-evaporated (but do not con- 
tribute ionizing photons). 

It appears to be a generic feature in the simula- 
tion results that the ratios of functions at large k fall 
off like a power law for 3^ xx {k)/ '&$$(k), and exponen- 
tially for & x5 {k)/& 55 {k). At small k, 0> xx {k)/ 0> ss (k) 
can either increase or decrease approximately linearly 
as k increases, while 3^ x s{k) / 3^ss{k) is asymptoti- 
cally constant. Our parametrization in Eq. (|18[) cap- 
tures these features: at large k, £? xx (k)/ ^ss{k) oc 
(24) k-i™ and &> xS {k)/ & ss (k) oc exp (-(jfe R xS ) 2 ); at 
small k, £ g xx (k)/& l 88(k) oc (1 - (7 XX a xx i? xx /2) k), and 
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FIG. 2: Fits to the ionization power spectra at several redshifts. Solid (blue) lines are the results of the radiative transfer 
simulation in Model I of the McQuinn et al. paper [28| ]. Dashed (green) lines are fitting curves of our parametrization. Dot- 
dashed (red) lines are best fits using the parametrization suggested by Santos and Cooray [3] • Top panels: &xx/ 3?5& = 
Pxx/(5h-P««)- Bottom panels: ^s/^ss = Pxs/(xhP ss ). From left to right: z = 9.2, 8.0, 7.5,' 7.0 (z; = 0.10, 0.30, 0.50, 0.70 
respectively) . 



^xs(k)/ ' @>ss{k) oc (1 - a x s RxS k) (both a xx and a x s 
can be either positive or negative). Figure [2] also shows 
that for P xx (fc) and also for P x s{k) at large k, our 
parametrization further improves over the parametriza- 
tion P(k)/P$s — b 2 e~( kR ) suggested by Santos and 
Cooray Q, which works well for P x s(k) at small k. 

To be conservative, we discard cosmological informa- 
tion from & x s{k) and ^ X x{k) in our Fisher matrix anal- 
ysis by using the fiducial power spectrum ^ssi^r 
rather than the actual one ^ss (k) in Eq. |T5J. This 
means that the derivatives of ^ x s(k) and ^ X x{k) with 
respect to the cosmological parameters vanish in Eq. (fM| . 
It is likely that we can do better in the future: once the 
relation between the ionization power spectra and the 
matter power spectrum can be reliably calculated either 
analytically or numerically, the ionization power spectra 
can contribute to further constraining cosmology. 

In addition to the fit of Model I shown in Figure 
we also fit our model (with different fiducial values from 
those listed in Table UlT]) to the simulations using Model II 
and III in [28] , and find that the parametrization is flex- 
ible enough to provide good fits to all three simulations, 
suggesting that the parametrization in Eq. (|18p may be 
generically valid and independent of models. Note, how- 
ever, that at low redshifts (xi > 0.7), our parametrization 
of ^xs/^ss does not work well at large k, in that the 



simulation falls off less rapidly than exponentially. This 
may be because when HII regions dominate the IGM, the 
ionized bubbles overlap in complicated patterns and cor- 
relate extremely non-linearly at small scales. This partial 
incompatibility indicates that our parametrization (i.e. 
Eg frS)) is only accurate for small X{, i.e. before non-linear 
ionization patterns come into play. 

In the remainder of this paper, we will adopt the values 
in Table IIIII as fiducial values of the ionization parame- 
ters. 



3. PESS model 

By parametrizing the ionization power spectra with 
a small number of constants, the MID model rests on 
our understanding of the physics of reionization. From 
the point of view of a maximally cautious experimen- 
talist, however, constraints on cosmological parameters 
should not depend on how well one models reionization. 
In this spirit, Barkana and Loeb [23l . l24j proposed what 
we adopt as our "PESS" model for separating the physics 
3P$s{k) from the "gastrophysics" 0* X x(k) and & x &(k). 
Instead of assuming a specific parametrization, the PESS 
model makes no a priori assumptions about the ioniza- 
tion power spectra. In each fc-bin that contains more 
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than three pixels in u-space, one can in principle sepa- 
rate P„4(fc) = ^ss(k) from the other two moments. The 
PESS model essentially only constrains cosmology from 
the term and therefore loses all information in P^o 
and P^-i . We now set up the Fisher matrix formalism for 
the PESS model that takes advantage of the anisotropy 
in PAr(k) arising from the velocity field effect. Numeri- 
cal evaluations will be performed in Section [ill Al 

The observable in 21cm tomography is the brightness 
temperature T;,(x). In Fourier space, the covariance ma- 
trix between two pixels k; and kj is Cjj = <Jy [Pat (kj ) + 
PnQzx)], assuming that the measurements in two differ- 
ent pixels are uncorrelated 2 . The total 21cm power spec- 
trum is Pat (k) = P^(k) + P^{k)^i 2 + P^ik)^. For 
convenience, we use the shorthand notation Pa, where 
Pi = P„a, P2 = P^2 and P3 = P^4 and define the a a = 
0,2,4 for A = 1,2,3, respectively. Thus the power spec- 
trum can be rewritten as Pat = J2a=i ^ A ^ aA ■ Treat- 
ing Pa(/c) at each fc-bin as parameters, the derivatives of 
the covariance matrix are simply dCij/dPA(k) — Sijfi aA , 
where |kj| resides in the shell of radius k with width Ak. 
Since the different fc-bins all decouple, the Fisher matrix 
for measuring the moments Pa (A;) is simply a separate 
3 x 3-matrix for each k-b'm: 



F AA >(k) 



1 

= x ti- 



de 



dC 



dP A {k) dP A >(k) 

fi a A +a A , 

2^ WpT^ruva ' 



upper half-shell 



(25) 



where 5P AT {^) = N~ 1 [Pvr(k) + PvO_l)]. Here 
Pjv(fcj_) is related to Pn(u±) by Eq. (fT6| . Again the 
sum is over the upper half of the spherical shell k < |k| < 

k+Ak. The la error of P 3 = P^ is SP 3 (k) -- 



F-\ 3 (k). 



Once ^ss = P^ is separated from other moments, 
can be used to constrain cosmological parameters A a with 
the Fisher matrix as given in Eq. (|2"Tj) . 

We have hitherto discussed the anisotropy in Pat (k) 
that arises from the velocity field effect. However, the 
AP-cffect may further contribute to the anisotropy in 
that it creates a /^-dependence and modifies the /z 4 term 
[26l The AP-effect can be distinguished from the 

velocity field effect since the P^e term is unique to the 
AP-cffect. Thus, one can constrain cosmological param- 
eters from P^4 and P^s [2j], involving the inversion of a 
4x4 matrix which loses even more information and there- 
fore further weakens constraints. Therefore, the PESS 
Fisher matrix that we have derived without taking AP- 
effect into account can be viewed as an upper bound on 
how well the PESS approach can do in terms of cos- 
mological parameter constraints. However, this maxi- 
mally conservative 4x4 matrix approach may be inap- 
propriately pessimistic, since the AP-induced clustering 



anisotropy is typically very small within the observation- 
ally allowed cosmological parameter range, whereas the 
velocity-induced anisotropies can be of order unity. 



C. Assumptions about Linearity 

To avoid fitting to modes where 5k is non-linear and 
physical modeling is less reliable, we impose a sharp cut- 
off at fc max and exclude all information for k > fc max . 
We take fc max = 2Mpc _1 for our MID model, and in- 
vestigate the fc max -dcpendence of cosmological parameter 
constraints in Section lill Bl 



D. Assumptions about non-Gaussianity 

Non-Gaussianity of ionization signals generically be- 
comes important at high X{. Using cosmic reionization 
simulations with a large volume and high resolution, Lidz 
et al. H§| and Santos et al. [4(| found non-negligible (a 
factor of 1.5) differences in the full power spectrum at 
high x x (x\ > 0.35)). To get a rough sense of the impact of 
non-Gaussianity on cosmological parameter constraints, 
we simply model it as increasing the sample variance by 
a factor £. We thus write the total power spectrum as 



6P AT (u) = N- 1 ' 2 [^Pat(u) + P N (u±)} 



(26) 



2 We ignore here a <5-function centered at the origin since 21cm 
experiments will not measure any k = modes. 



where £ is the factor by which the the sample variance is 
increased. The parameter £ should take the value £ w 1 
(Gaussian) at epochs with low 5; and 1 < £ < 2 (non- 
Gaussian) at high x\. 



E. Assumptions about reionization history and 
redshift range 

21cm tomography can probe a wide range of redshifts, 
as illustrated in Figure [TJ However, one clearly cannot 
simply measure a single power spectrum for the entire 
volume, as the clustering evolves with cosmic time: The 
matter power spectrum changes gradually due to the lin- 
ear growth of perturbations [37[. More importantly, the 
ionization power spectra vary dramatically with redshift 
through the epoch of reionization. We incorporate these 
complications by performing our analysis separately in a 
series of redshift slices, each chosen to be narrow enough 
that the matter and ionization power spectra can be 
approximated as constant in redshift within each slice. 
This dictates that for a given assumed reionization his- 
tory, thinner redshift slices must be used around redshifts 
where 5h varies dramatically. 

In this paper, we will consider two rather opposite toy 
models in Section HTT1 

• OPT: A sharp reionization that begins and finishes 
at one redshift (say z < 7). 
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TABLE IV: Specifications for 21cm interferometers 



Experiment 


Want 


Min. base- 


f.o.v. 


A e (m 2 ) at 






line (m) 


(deg 2 ) 


z=6/8/12° 


MWA 


500 


4 


7rl6^ 


9/14/18 


SKA 


7000 


10 


tt8.6 2 


30/50/104 


LOFAR 


77 


100 


2 X tt2.4 2 


397/656/1369 


FFTT 


10 6 


1 


2tt 


1/1/1 



"We assume that the effective collecting area is proportional to 
A 2 such that the sensitivity (A e /T syB in m 2 K — 1 ) meets the design 
specification. 



• MID/PESS: A gradual reionization that spans a 
range of redshifts, assuming the ionization param- 
eter values that fit Model I simulation of the Mc- 
Quinn et al. paper [28[ 



For the latter scenario, the ionization fraction xn is not 
a linear function of redshift. For example, in in the Mc- 
Quinn et al. [28[ simulation, xh =0.9, 0.7, 0.5 and 0.3 
correspond to redshifts z = 9.2, 8.0, 7.5 and 7.0, respec- 
tively. For our different scenarios, we therefore adopt 
the redshift ranges 6.8 < z < 10 that are divided into 
four redshift slices centered at the above redshifts (OPT), 
6.8 < z < 8.2 split into three bins centered at z=7.0, 7.5 
and 8.0 (MID), 7.3 < z < 8.2 split into two slices centered 
at z = 7.5 and 8.0. 



F. Assumptions about cosmological parameter 
space 

Since the impact of the choice of cosmological param- 
eter space and related degeneracies has been extensively 
studied in the literature, we will perform only a basic 
analysis of this here. We work within the context of 
standard inflationary cosmology with adiabatic pertur- 
bations, and parametrize cosmological models in terms 
of 12 parameters (see, e.g. , Table 2 in (4f| for explicit 
definitions) whose fiducial values are assumed as follows: 
ilk = (spatial curvature), fl\ = 0.7 (dark energy den- 
sity), £!b = 0.046 (baryon density), h = 0.7 (Hubble pa- 
rameter Ho = 100/ikms -1 Mpc -1 ), r = 0.1 (reionization 
optical depth), Q v — 0.0175 (massive neutrino density), 
n s = 0.95 (scalar spectral index), A s — 0.83 (scalar fluc- 
tuation amplitude), r — (tensor-to-scalar ratio), a = 
(running of spectral index), n t — (tensor spectral in- 
dex) and w = — 1 (dark energy equation of state) . We will 
frequently use the term "vanilla" to refer to the minimal 
model space parametrized by (SIa, ^mh 2 , flbh 2 , n s , A s , t) 
combined with xh(z) and ionization parameters at all 
observed z-bins, setting f2k, Qvh 2 , r, a, n t , and w fixed at 
their fiducial values. 



G. Assumptions about Data 

The MWA, LOFAR, SKA and FFTT instruments are 
still in their planning/design/development stages. In 
this paper, we adopt the key design parameters from 
for MWA, [10] and www.skatelescope.org for SKA, 
www.lofar.org for LOFAR, and [ll| for FFTT unless ex- 
plicitly stated. 



1. Interferometers 

We assume that MWA will have 500 correlated 4m x 4m 
antenna tiles, each with 16 dipoles. Each individual tile 
will have an effective collecting area of 14 m 2 at z = 8 and 
18 m 2 at z > 12. LOFAR will have 77 large (diameter 
~ 100 m ) stations, each with thousands of dipole anten- 
nae such that it has the collecting area nearly 50 times 
larger than each antenna tile of MWA. Each station can 
simultaneously image N regions in the sky. We set N = 2 
in this paper but this number may be larger for the real 
array. The design of SKA has not been finalized. We 
assume the "smaller antennae" version of SKA, in which 
SKA will have 7000 small antennae, much like MWA, 
but each panel with much larger collecting area. FFTT 
stands for Fast Fourier Transform Telescope, a future 
square kilometer array optimized for 21 cm tomography 
as described in [111 ]. Unlike the other interferometers we 
consider, which add in phase the dipoles in each panel 
or station, FFTT correlates all of its dipoles, resulting 
in more information. We evaluate the case where FFTT 
contains a million lm x lm dipole antennae in a con- 
tiguous core subtending a square kilometer, providing a 
field-of-view of 2-7T steradians. 

For all interferometers, we assume that the collect- 
ing area A e oc A 2 , like a simple dipole, except that A e 
is saturated at z ~ 12 in MWA since the wavelength 
A = 21(1 + 2) cm exceeds the physical radius of an MWA 
antenna panel. The summary of the detailed specifica- 
tions adopted in this paper is listed in Table ITVl 



2. Configuration 

The planned configurations of the above-mentioned in- 
terferometers are quite varied. However, all involve some 
combination of the following elements, which we will ex- 
plore in our calculations: 

1. A nucleus of radius Rq within which the area cov- 
erage fraction is close to 100%. 

2. A core extending from radius Rq our to i?i n where 
there coverage density drops like some power law 

«.— n 

3. An annulus extending from R ln to i? ut where the 
coverage density is low but rather uniform. 
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In its currently planned design, the MWA will have a 
small nucleus, while the core density falls off as r~ 2 until 
a sharp cutoff at R{ a . For LOFAR we assume 32 sta- 
tions in the core, and another 32 stations in an outer 
annulus out to radius i? ut ~ 6 km. For SKA we as- 
sume 20% in the core, and 30% in the annulus out to 
radius i? ut ~ 5 km. We ignore the measurements from 
any dilute distribution of antenna panels outside R ou t- 
For LOFAR and SKA, we assume a uniform distribu- 
tion of antennae in the annulus, but with an inner core 
profile like that of the MWA, i.e., a nucleus of radius 
Rq = 285/189 m (LOFAR/SKA) and an r~ 2 fall-off out- 
side this compact core. We assume an azimuthally sym- 
metric distribution of baselines in all arrays. 

For an array with iVj n antennae within R in , we can 
define a quantity 



(27) 



where po is the area density of the nucleus. i?™ ax is the 
maximal radius of the nucleus, corresponding to the case 
where there it contains all the N[ n antennae and there is 
no core. 

It is also convenient to parametrize the distribution 
of these iV; n antennae within i?j n by two numbers: the 
fraction r\ that are in the nucleus, and the fall-off index 
n of the core. It is straightforward to show that Rq and 
i?i„ are related to 77 and n by 




R = s /rjR [ 



max 
: 



(28) 




MWA fixing R 




10' 


10 s 




"1 ''ill 

\ : 




R oul : 

i 


I SKA fixing R,„ 
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10 s 


1 111 
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r R ° \ 

\ 
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1 : 

1 : 
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FIG. 3: Examples of array configuration changes. For MWA 
(upper panels), antennae are uniformly distributed inside the 
nucleus radius -Ro, and the density p falls off like a power 
law for Rq < r < Ri n where Ri n is the core radius. For 
SKA (lower panels) and similarly for LOFAR, there is in ad- 
dition a uniform yet dilute distribution of antennae in the 
annulus R in < r < R ou t, where R ou t is the outer annulus 
radius. When 7? is decreased (Ro = 0.7/0.5/0.3 x i?g lax ) 
with Rin = 3.0 x i?™ ax fixed (left panels), the density in the 
core falls off slower (blue/red/green curves). When Ri n is de- 
creased (R in = 4.0/3.0/2.0 x Rff 3 *) with R = 0.5 x R^ 
fixed (right panels), the density in the core also falls off less 
steep (dashed/solid/dotted curves). 



if n 7^ 2. The analytic relation for n = 2 is R[ n = 
Ro exp [(1 — 77)/ (277)], which can be well approximated in 
numerical calculation by by taking n = 2 + e in Eq. (|29|) 
with e - 10~ 10 . 

In Section UlI El we will scan almost all possible design 
configurations and find the optimal one for constraining 
cosmology. There are two independent ways to vary array 
configurations, as illustrated by Figure [3] by varying R 
with Ri„ fixed, and by varying Ri n with Rq fixed. Con- 
tributions from antennae in the annulus are negligible 
compared to the core, so varying R ou t is not interesting. 

In other parts of Section IIII1 we will assume the in- 
termediate configuration 77 = 0.8 and n = 2 (except 
for FFTT which is purely in a giant core) with the 
planned number of antennae in the core and annulus. 
Note that this configuration is optimized from the cur- 
rently planned design. 



3. Detector noise 

21cm radio interferometers measure visibility V. The 
visibility for a pair of antennae is defined as [3l| 

VK%,A/) = J dxdyAT b (x,y,Af)e- i( - uM «y\ 

(30) 

where (u x ,u y ) are the number of wavelengths be- 
tween the antennae. The hydrogen 3D map is the 
Fourier transform in the frequency direction I(u) = 
J dAf\(u x ,u y ,Af)exp(—iAfm\) where u = u x e x + 
e z . The detector noise covariance matrix for 
an interferometer is 0, HH 

c » K , U)) = (^) 2 i (31) 

where B is the frequency bin size, T sys is system temper- 
ature, and t u « (A e to/ X 2 )n(u±) is the observation time 
for the visibility at |uj_| = c?a |k| sin 9. Here 7j is the total 
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observation time, and n is the number of baselines in an 
observing cell. 

The covariance matrix of the 21cm signal I(u) is re- 
lated to the power spectrum PatO<) by [2| 

c w ( Ul ,u,) = (r(u,)/( Ui )) 

= PaAu^^-S^ . (32) 
Therefore, the noise in the power spectrum is 

P n {u_l) = ( 2 ^— . (33) 

V ' \ A e ) t n(u ± ) y ' 

For all interferometers, the system temperature is dom- 
inated by sky temperature T s k y ~ 60(A/1 m) 2 55 K due 
to synchrotron radiation in reasonably clean parts of the 
sky. Following (29[, we set T sys = 440 if at z = 8 and 
T svs = 690 if at z = 10. 



H. Assumptions about Residual Foregrounds 

There have been a number of papers discussing fore- 
ground removal for 21 cm tomography (e.g. [H, [H, l35l . 
l36j and references therein), and much work remains to 
be done on this important subject, as the the ampli- 
tudes of residual foregrounds depend strongly depends 
on cleaning techniques and assumptions, and can have 
potentially dominate the cosmological signal. The work 
of Wang et al. [33[ and McQuinn et al. 0] suggested 
that after fitting out a low-order polynomial from the 
frequency dependence in each pixel, the residual fore- 
grounds were negligible for k > 2n/yB where yB is the 
comoving width of a z-bin. To obtain a crude indica- 
tion of the impact of residual foregrounds, there there- 
fore we adopt the rough approximation that all data be- 
low some cutoff value £; m j n is destroyed by foregrounds 
while the remainder has negligible contamination. We 
choose fc mi „ = (1/2/4) x %/yB for the OPT/MID/PESS 
scenarios, and also explore wider ranges below. 



III. RESULTS AND DISCUSSION 

In this section, we numerically evaluate how the ac- 
curacy of cosmological parameter constraints depend on 
the various assumptions listed above. Where possible, 
we attempt to provide intuition for these dependences 
with simple analytical approximations. In most cases, 
we explore the dependence on one assumption at a time 
by evaluating the PESS, MID and OPT scenario for this 
assumption while keeping all other assumptions fixed to 
the baseline MID scenario. 



A. Varying ionization power spectrum modeling 
and reionization histories 

1. Basic results 

We start by testing assumptions in the ionization 
power modeling of P xx and P x s ■ In Table [V] we show 
the accuracy with which the 21cm power spectrum can 
place constraints on the cosmological parameters from 
three 2-bins ranging from z = 6.8 — 8.2. We fix the as- 
sumptions concerning fc max , the foreground removal, and 
the array layout and specifications, but vary the sophis- 
tication with which we model the ionization power. 

Our results agree with those of previous studies 
0, i.e. 21cm data alone (except for the opti- 
mized FFTT) cannot place constraints comparable with 
those from Planck CMB data. However, if 21cm 
data are combined with CMB data, the parameter 
degeneracies can be broken, yielding stringent con- 
straints on f2k, m v and a. For example, in the 
OPT model, from LOFAR/MWA/SKA/FFTT combined 
with Planck, the curvature density f2k can be mea- 
sured 5/3/15/78 times better, to a precision A£\ = 
0.005/0.008/0.002/0.0003, the neutrino mass m u can be 
constrained 4/9/14/74 times better to accuracy Am„ = 
0.06/0.03/0.02/0.003, and running of the scalar spec- 
tral index a can be done 1/2/4/28 times better, to 
Aa = 0.002/0.001/0.0006/0.0001. The more realistic 
MID model yields weaker yet still impressive constraints: 
from SKA/FFTT combined with Planck, f\ can be mea- 
sured 6/109 times better, to Ail k = 0.004/0.0002, m v 
4/35 times better, to Am„ = 0.06/0.007, and a 1/15 
times better, to Aa = 0.002/0.0002. The improved mea- 
surements of fik and a enable further precision tests 
of inflation, since generically fik is predicted to van- 
ish down to the 10 -5 level, while the simplest inflation 
models (with a single slow-rolling scalar field) predict 
a ~ (1 — n s ) 2 ~ 10~ 3 . For example, the inflaton poten- 
tial V{4>) cx (j) 2 predicts a w -0.0007, while V((j)) oc (j) 4 
predicts a = 0.008. In addition, 21cm data combined 
with CMB data from Planck can make accurate mea- 
surements in the mean neutral fraction xn(z) at separate 
redshifts, outlining the full path of reionization, e.g. at 
the Ax K (z) - 0.01/0.003 level from SKA/FFTT data 
combined with Planck data. 



2. OPT and MID models 

For most 21cm experiments, the OPT model yields 
stronger constraints than the MID model. The reason 
is as follows. By assuming P xx = P^s — 0, there are 
essentially no neutral fraction fluctuations in the OPT 
model. This means that this model is an ideal model 
in which the 21cm power spectrum encodes cosmological 
information per se, since Pat(^) oc (k) at each pixel 
in the Fourier space. In the more realistic MID model, 
however, the nuisance ionization parameters has correla- 
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TABLE V: How cosmological constraints depend on the ionization power spectrum modeling and reionization history. We 
assume observations of 4000 hours on two places in the sky in the range of z = 6.8 — 8.2 that is divided into three z-bins 
centered at z — 7.0, 7.5 and 8.0 respectively, fc max = 2Mpc _1 , k m in = 2-iv/yB and a quasi-giant core configuration (except for 
FFTT that is a giant core), la errors of ionization parameters in the MID model, marginalized over other vanilla parameters, 
are listed separately in Table IVTl 



Vanilla Alone 





Model 


ASIa 


Aln(« m /i a ) A \n(U b h' A ) 


An 


A In A s 


At 




Ax H (7.5) Ax H (8.0) 


ACk 




Arv 


LOFAR 


OPT 


0.025 


0.27 


0.44 


0.063 


0.89 










0.14 


0.87 


0.027 




MID 


0.13 


U.Uoo 


U.lo 


0.36 


0.80 










0.35 


12 


0.17 


MWA 


OPT 




0.11 


0.19 


u.uzz 


U.O 1 










U.UOO 


U.OO 


U.Ulo 




MID 


0.22 


0.017 


0.029 


0.097 


0.76 










0.13 


9.6 


0.074 


SKA 


OPT 


n nrto o 

U.UUoo 


U.U44 


U.Uoo 


U.U079 


0.16 










0.023 


0.12 


U.UU4U 




MID 


n m a 

U.U14t 


0.0049 


0.0081 


n nio 


U.Uo / 










U.U4o 


U.ou 


U.UUDU 




OPT 


0.00015 


0.0032 


0.0083 


("1 l"\<"ll"\ 1 1"\ 

0.00040 


0.015 










0. 00098 


0.011 


0.00034 


FFTT 


MID 


0.00U41 


0.00038 


0.00062 


0.00036 


0.0013 










0.0037 


0.0078 


0.00017 




PESS 


1.1 


0.017 


0.037 


0.010 


0.19 












0.20 


0.0058 


Planck 




0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 








0.025 


0.23 


0.0026 




OPT 


0.0066 


0.0077 


0.0058 


0.0031 


0.0088 


0.0043 


0.0077 


0.0084 


0.0093 


0.0051 


0.060 


0.0022 


+LOFAR MID 


0.0070 


0.0081 


0.0059 


0.0032 


0.0088 


0.0043 


0.18 


0.26 


0.23 


0.018 


0.22 


0.0026 




PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 


0.54 


0.31 


0.24 


0.025 


0.23 


0.0026 




OPT 


0.0067 


0.0079 


0.0057 


0.0031 


0.0088 


0.0043 


0.0065 


0.0067 


0.0069 


0.0079 


0.027 


0.0014 


+MWA 


MID 


0.0061 


0.0070 


0.0056 


0.0030 


0.0087 


0.0043 


0.32 


0.22 


0.29 


0.021 


0.19 


0.0026 




PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 


3.8 


0.87 


0.53 


0.025 


0.23 


0.0026 




OPT 


0.0031 


0.0038 


0.0046 


0.0013 


0.0087 


0.0042 


0.0060 


0.0060 


0.0060 


0.0017 


0.017 


0.00064 


+SKA 


MID 


0.0036 


0.0040 


0.0044 


0.0025 


0.0087 


0.0043 


0.0094 


0.014 


0.011 


0.0039 


0.056 


0.0022 




PESS 


0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 


0.061 


0.024 


0.012 


0.025 


0.21 


0.0026 




OPT 


0.00015 


0.0015 


0.0036 


0.00021 


0.0087 


0.0042 


0.0056 


0.0056 


0.0056 


0.00032 


0.0031 


0.000094 


+FFTT 


MID 


0.00038 


0.00034 


0.00059 


0.00033 


0.0086 


0.0042 


0.0013 


0.0022 


0.0031 


0.00023 


0.0066 


0.00017 




PESS 


0.0055 


0.0064 


0.0051 


0.0030 


0.0087 


0.0043 


0.0024 


0.0029 


0.0040 


0.025 


0.020 


0.0010 



a xn(z) denotes the mean neutral fraction at the central redshift 
z. xn(z)'s and A B are completely degenerate from the 21cm mea- 
surement alone. For this reason, the errors shown for lnA s from 
21cm data alone is really not marginalized over xn(z)'s. 



TABLE VI: la marginalized errors for the ionization param- 
eters in the MID model. Assumptions are made the same 
as in Table [V] i? X x and R x s are in units of Mpc and other 
parameters are unitless. 



2 




Ab 2 

XX 


Ai? xx 


A« xx 


A7 XX 


Ab 2 x 


AR*s 






Values 


77. 


3.0 


4.5 


2.05 


8.2 


0.143 


28. 




LOFAR 


94 


140 


130 


27 


5.1 


49 


9600 


7.0 


MWA 


20 


43 


43 


8.3 


2.6 


16 


3200 




SKA 


9.1 


9.8 


8.7 


2.0 


0.49 


2.6 


520 




FFTT 


0.59 


0.47 


0.39 


0.098 


0.027 


0.088 


17 




Values 


9.9 


1.3 


1.6 


2.3 


3.1 


0.58 


2. 




LOFAR 


2.2 


55 


18 


73 


1.4 


5.7 


24 


7.5 


MWA 


4.3 


16 


4.9 


22 


1.8 


1.8 


8.1 




SKA 


0.18 


1.7 


0.71 


2.1 


0.076 


0.17 


0.78 




FFTT 


0.0072 


0.027 


0.015 


0.030 


0.0023 


0.0021 


0.012 




Values 


2.12 


1.63 


-0.1 


1.35 


1.47 


0.62 


0.46 




LOFAR 


1.6 


20 


2.1 


34 


1.2 


3.4 


6.9 


8.0 


MWA 


2.7 


13 


4.2 


24 


1.5 


1.6 


2.8 




SKA 


0.085 


0.60 


0.090 


0.90 


0.057 


0.095 


0.24 




FFTT 


0.0017 


0.013 


0.0026 


0.017 


0.0013 


0.0014 


0.0030 



tions with cosmological parameters. Mathematically, the 
inversion of a correlated matrix multiplies each error by 
a degradation factor. 

An exception is the FFTT, where the situation is re- 
versed. As mentioned in Section III B 21 the sample vari- 
ance Pat in the MID model is smaller than that in the 



OPT model because of two reasons: (i) the MID model 
assumes non-zero Pxx and P x s, and P x $ has negative con- 
tribution to the total power spectrum (see Eqs[H]and[7]); 
(ii) the OPT model assumes xh = 1, but xh takes realis- 
tic values (less than 1) in the MID model, decreasing the 
overall amplitude. In a signal-dominated experiment, re- 
duced sample variance can be more important than the 
degradation from correlations. 



3. PESS model 

Our results show that even combined with CMB data 
from Planck, the 21cm data using the PESS model can- 
not significantly improve constraints. There are two rea- 
sons for this failure. Firstly, the PESS model essentially 
uses only P^&^k) to constrain cosmology, by marginaliz- 
ing over P M o and P M 2 . This loses a great deal of cosmo- 
logical information in the contaminated P„o and P^ , in 
contrast to the situation in the OPT and MID models. 
Secondly, to effectively separate P fl i(k) from other two 
moments, the available Fourier pixels should span a large 
range in //. Figure |4] shows that in MWA and FFTT, the 
data set is a thin cylinder instead of a sphere. The limita- 
tion in /i-range will give large degradation factors during 
the inversion of Fisher matrix. (In the limit that there is 
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-2-101-2-101 



k ± [Mpc- 1 ] 

FIG. 4: Available (k±,k\\) pixels from MWA (upper left), 
FFTT (upper right), LOFAR (lower left) and SKA (lower 
right), evaluated at z = 8. The blue/grey regions can be 
measured with good signal-to-noise from the nucleus and core 
of an array, while the cyan/light-grey regions are measured 
only with the annulus and have so poor signal-to-noise that 
they hardly contribute to cosmological parameter constraints. 



only one /x for each shell, then the Fisher matrix is sin- 
gular and the degradation factor is infinite.) These two 
factors work together with the noise level to shrink the 
useful fc-modes into a rather narrow range: as shown in 
Figure U A@> S6 < ^S6 only for k = 0.09 - 0.4 Mpc -1 in 
SKA, k = 0.07 - 1 Mpc -1 in FFTT and over zero modes 
in LOFAR and MWA. 



B. Varying fc max 

We test how varying /e max affects constraints in this 
section. The cutoff fc max depends on the scale at which 
non-linear physics, e.g. the non-linear clustering of den- 
sity perturbations or the irregularities of ionized bub- 
bles, enter the power spectrum. It is illustrated in 
the right panel of Figure [6] that generically cosmologi- 
cal constraints asymptotically approach a value as A; max 
increases above ~ 2 Mpc" 1 (this typical scale can be 
larger for cosmic variance limited experiments such as 
FFTT). Not much cosmological information is garnered 
from these high-/c modes because detector noise becomes 
increasingly important with k. The upshot is that the 
accuracy only weakly depends on fc max . 



*5 
< 




ktMpc- 1 ] 

FIG. 5: Relative lcr error for measuring &gs(k) with the 
PESS model by observing a 6MHz band that is centered at 
z = 8 with MWA (red/solid), LOFAR (blue/short-dashed), 
SKA (green/dotted) and FFTT (cyan/long-dashed). The 
step size is A In A; w 0.10. 



C. Varying the non-Gaussianity parameter £ 

Table (Til shows the effect of changing the non- 
Gaussianity parameter £ in Section III Dl from the £ = 1 
(Gaussian) case to £ = 2 in the PESS scenario, along 
with changing other assumptions. However, there is no 
need to perform extensive numerical investigation of the 
the impact of £, since it is readily estimated analyti- 
cally. Because la error Api in cosmological parameters 
is (F~ 1 )a, it follows directly from Eq. [26] that Ap does 
not appreciably depend on £ for noise dominated exper- 
iments like MWA and LOFAR, whereas Ap oc £ CT with 
er < 1 for (nearly) signal dominated experiments like 
SKA and FFTT. Compared with the other effects that we 
discuss in this section, this (no more than linear) depen- 
dence on the non-Gaussianity parameter £ is not among 
the most important factors. 



D. Varying redshift ranges 

We now test how accuracies depend on the redshift 
ranges. In Table EED (OPT model) and IvTlTI (MID 
model), we consider the optimistic/middlc/pcssimistic 
ranges, z = 6.8-10/ 6.8-8.2 / 7.3-8.2 which is divided 
by n z — A/3/2 z-bins. The results show that, from 21cm 
data alone, the constraints from the extreme ranges dif- 
fer significantly (a factor of 5 for Af2k). Therefore, the 
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TABLE VII: How cosmological constraints depend on the redshift range in OPT model. Same assumptions as in Table FVl but 
for different redshift ranges and assume only OPT model. 



Vanilla Alone 





z range 


Afi A 


Aln{U m h") 


Aln(tt b /i z ) 


An, 


A In A s 


At 


Ax H (7.0) 


Ax H (7.5) 


A2 H (8.0) 


A2 H (9.2) 


A!! k 


Am„ [cV] 


Aa 




6.8-10 


0.021 


0.20 


0.34 


0.049 


0.67 












0.086 


0.75 


0.023 


LOFAR 


6.8-8.2 


0.025 


0.27 


0.44 


0.063 


0.89 












0.14 


0.87 


0.027 




7.3-8.2 


0.036 


0.38 


0.61 


0.090 


1.2 












0.24 


1.3 


0.038 




6.8-10 


0.037 


0.072 


0.14 


0.016 


0.25 












0.031 


0.31 


0.011 


MWA 


6.8-8.2 


0.046 


0.11 


0.19 


0.022 


0.37 












0.056 


0.38 


0.013 




7.3-8.2 


0.070 


0.15 


0.27 


0.032 


0.51 












0.097 


0.53 


0.018 




6.8-10 


0.0032 


0.031 


0.061 


0.0058 


0.12 












0.012 


0.096 


0.0032 


SKA 


6.8-8.2 


0.0038 


0.044 


0.083 


0.0079 


0.16 












0.023 


0.12 


0.0040 




7.3-8.2 


0.0053 


0.059 


0.11 


0.011 


0.21 












0.042 


0.17 


0.0054 




6.8-10 


0.00012 


0.0023 


0.0058 


0.00030 


0.011 












0.00045 


0.0073 


0.00023 


FFTT 


6.8-8.2 


0.00015 


0.0032 


0.0083 


0.00040 


0.015 












0.00098 


0.011 


0.00034 




7.3-8.2 


0.00021 


0.0042 


0.011 


0.00052 


0.019 












0.0021 


0.014 


0.00043 


Planck 




0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 










0.025 


0.23 


0.0026 




6.8-10 


0.0065 


0.0076 


0.0057 


0.0031 


0.0088 


0.0043 


0.0077 


0.0084 


0.0082 


0.0090 


0.0046 


0.051 


0.0021 


+LOFAR 


6.8-8.2 


0.0066 


0.0077 


0.0058 


0.0031 


0.0088 


0.0043 


0.0077 


0.0084 


0.0093 




0.0051 


0.060 


0.0022 




7.3-8.2 


0.0068 


0.0079 


0.0058 


0.0032 


0.0088 


0.0043 




0.0085 


0.0093 




0.0072 


0.081 


0.0024 




6.8-10 


0.0065 


0.0076 


0.0056 


0.0031 


0.0088 


0.0043 


0.0065 


0.0067 


0.0066 


0.0067 


0.0066 


0.023 


0.0013 


+MWA 


6.8-8.2 


0.0067 


0.0079 


0.0057 


0.0031 


0.0088 


0.0043 


0.0065 


0.0067 


0.0069 




0.0079 


0.027 


0.0014 




7.3-8.2 


0.0068 


0.0080 


0.0058 


0.0032 


0.0088 


0.0043 




0.0067 


0.0069 




0.011 


0.036 


0.0017 




6.8-10 


0.0027 


0.0035 


0.0045 


0.0012 


0.0087 


0.0042 


0.0060 


0.0060 


0.0060 


0.0060 


0.0016 


0.015 


0.00061 


+SKA 


6.8-8.2 


0.0031 


0.0038 


0.0046 


0.0013 


0.0087 


0.0042 


0.0060 


0.0060 


0.0060 




0.0017 


0.017 


0.00064 




7.3-8.2 


0.0039 


0.0047 


0.0049 


0.0017 


0.0087 


0.0042 




0.0060 


0.0060 




0.0020 


0.019 


0.00075 




6.8-10 


0.00013 


0.0014 


0.0033 


0.00019 


0.0087 


0.0042 


0.0054 


0.0054 


0.0054 


0.0054 


0.00026 


0.0025 


0.000078 


+FFTT 


6.8-8.2 


0.00015 


0.0015 


0.0036 


0.00021 


0.0087 


0.0042 


0.0056 


0.0056 


0.0056 




0.00032 


0.0031 


0.000094 




7.3-8.2 


0.00020 


0.0016 


0.0038 


0.00023 


0.0087 


0.0042 




0.0057 


0.0057 




0.00040 


0.0038 


0.00011 



TABLE VIII: How cosmological constraints depend on the redshift range in MID model. Same assumptions as in Table IVl but 
for different redshift ranges and assume only MID model. 



Vanilla Alone 





z range 


AS1 A 


Aln(tt m /i z ) 


Aln(fl b /i z ) 


An s 


AlnA a 


At 


A2 H (7.0) 


A2 H (7.5) 


AS H (8.0) 


A2 H (9.2) 


AO k 


Am, [cV] 


Aa 




6.8-10 


0.090 


0.055 


0.093 


0.18 


0.43 












0.22 


5.7 


0.080 


LOFAR 


6.8-8.2 


0.13 


0.083 


0.15 


0.36 


0.80 












0.35 


12 


0.17 




7.3-8.2 


0.21 


0.099 


0.15 


0.42 


0.81 












0.62 


15 


0.18 




6.8-10 


0.15 


0.012 


0.020 


0.031 


0.46 












0.092 


4.4 


0.025 


MWA 


6.8-8.2 


0.22 


0.017 


0.029 


0.097 


0.76 












0.13 


9.6 


0.074 




7.3-8.2 


0.40 


0.018 


0.030 


0.099 


1.0 












0.32 


18 


0.083 




6.8-10 


0.010 


0.0031 


0.0056 


0.0073 


0.023 












0.031 


0.23 


0.0032 


SKA 


6.8-8.2 


0.014 


0.0049 


0.0081 


0.012 


0.037 












0.043 


0.36 


0.0060 




7.3-8.2 


0.018 


0.0050 


0.0081 


0.013 


0.039 












0.072 


0.41 


0.0063 




6.8-10 


0.00029 


0.00021 


0.00043 


0.00025 


0.00097 












0.0020 


0.0055 


0.00011 


FFTT 


6.8-8.2 


0.00041 


0.00038 


0.00062 


0.00036 


0.0013 












0.0037 


0.0078 


0.00017 




7.3-8.2 


0.00050 


0.00039 


0.00062 


0.00037 


0.0013 












0.0058 


0.0083 


0.00018 


Planck 




0.0070 


0.0081 


0.0059 


0.0033 


0.0088 


0.0043 










0.025 


0.23 


0.0026 




6.8-10 


0.0069 


0.0080 


0.0058 


0.0032 


0.0088 


0.0043 


0.18 


0.26 


0.15 


0.23 


0.017 


0.22 


0.0026 


+LOFAR 


6.8-8.2 


0.0070 


0.0081 


0.0059 


0.0032 


0.0088 


0.0043 


0.18 


0.26 


0.23 




0.018 


0.22 


0.0026 




7.3-8.2 


0.0070 


0.0081 


0.0059 


0.0032 


0.0088 


0.0043 




0.27 


0.23 




0.023 


0.22 


0.0026 




6.8-10 


0.0056 


0.0065 


0.0054 


0.0029 


0.0087 


0.0043 


0.32 


0.22 


0.091 


0.36 


0.020 


0.11 


0.0025 


+MWA 


6.8-8.2 


0.0061 


0.0070 


0.0056 


0.0030 


0.0087 


0.0043 


0.32 


0.22 


0.29 




0.021 


0.19 


0.0026 




7.3-8.2 


0.0061 


0.0071 


0.0056 


0.0030 


0.0087 


0.0043 




0.25 


0.29 




0.024 


0.19 


0.0026 




6.8-10 


0.0025 


0.0027 


0.0038 


0.0023 


0.0087 


0.0042 


0.0094 


0.014 


0.0075 


0.024 


0.0032 


0.033 


0.0020 


+SKA 


6.8-8.2 


0.0036 


0.0040 


0.0044 


0.0025 


0.0087 


0.0043 


0.0094 


0.014 


0.011 




0.0039 


0.056 


0.0022 




7.3-8.2 


0.0036 


0.0041 


0.0044 


0.0025 


0.0087 


0.0043 




0.015 


0.011 




0.0053 


0.056 


0.0023 




6.8-10 


0.00033 


0.00021 


0.00043 


0.00024 


0.0086 


0.0042 


0.0013 


0.0022 


0.0030 


0.0040 


0.00020 


0.0052 


0.00011 


+FFTT 


6.8-8.2 


0.00038 


0.00034 


0.00059 


0.00033 


0.0086 


0.0042 


0.0013 


0.0022 


0.0031 




0.00023 


0.0066 


0.00017 




7.3-8.2 


0.00041 


0.00035 


0.00059 


0.00033 


0.0086 


0.0042 




0.0022 


0.0031 




0.00024 


0.0070 


0.00017 



sensitivity of a 21cm telescope depends strongly on the 
frequency range over which it can observe the signal. 



E. Optimal configuration: varying array layout 

In this section we first investigate how array layout af- 
fects the sensitivity to cosmological parameters. Next, 
we investigate the optimal array configuration for fixed 
antennae number. Our parametrization of the array con- 



figuration is discussed in Section Hi G 21 

We map the constraint in m v on the Ro~R m plane in 
Figure [7| (OPT model) and Figure [8] (MID model). R Q 
is the radius of the compact core, and i?j n the radius 
of inner core, both in the unit of i? nax = y/N- in / poir. 
Note that if Rq = i? nax , then R in = Rff ax — this is the 
case of a "giant core", in which all antennae are com- 
pactly laid down with a physical covering fraction close 
to unity, and is represented by the x-axis in the Ro-R ln 
plane (the value of Rq is meaningless if R- m = R™ ax ). In 
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TABLE IX: Optimal configuration for various 21cm interferometer arrays. Same assumptions as in Table IVl but for different 
array layout. -Rfn° P ' s ^ ne previously proposed inner core radius, rj is the ratio of the number of antennae in the nucleus to the 
total number inside the core, n is the fall-off index by which p oc r~ n outside the nucleus. 





Experiment 


Rg lax (m) 




R in (xR^) 


R?™ p (m) ° 


V 


n 


Comments 




LOFAR 


319 


0.84 


1.28 


1000 


0.71 


6.0 


Almost a giant core 


OPT 


MWA 


50 


0.64 


2.41 


750 


0.41 


3.0 


Close to a giant core 




SKA 


211 


0.30 


1.56 


1000 


0.09 


0.83 


Almost a giant core 




LOFAR 


319 


0.84 


1.28 


1000 


0.71 


6.0 


Almost a giant core 


MID 


MWA 


50 


0.45 


10 


750 


0.20 


2.3 


Both a large nucleus and a wide-spread core 




SKA 


211 


0.68 


1.57 


1000 


0.46 


2.9 


Almost a giant core 



"Note that in LOFAR and SKA there is an annulus with the outer 
radius 6 km and 5 km respectively. So for them i?j n is not the size 
of total array. 




io-" L- ' — 1 — — — 1 1 — 1 — 1 — 1 — 

l l 

k mln k max [Mpc-i] 

FIG. 6: How cosmological constraints An a depend on fe m i n 
(left panel) and fc ma x (right panel) in the MID model with 
the 21cm experiments MWA (red/solid), LOFAR (blue/short- 
dashed), SKA (green/dotted) and FFTT (cyan/long-dashed). 
We plot An s in this example because it has the strongest 
dependence on fc m in and fe max of all cosmological parameters. 
The quantity 2ir/yB varies in redshift, so as the horizontal 
axis of the left panel, we use the overall scale « m in = fc m m x 
(yB /2n) which is equal for all z-bins, 



Table IIX| we list the optimal configuration that is indi- 
cated by Figure [7] and [5J The compactness of an array is 
represented by i?i n /i?™ ax , since i?g lax is the minimum of 
R ln . In comparison, Ro/R™ ax does not indicate the com- 
pactness, since a slow fall-off configuration with a small 
i?o is effectively very close to a giant core. Rather, i? is 
a transition point from a flat compact core to the fall-off 
region. Note that we have three configuration parame- 
ters Ro, i?i n and i? ou t- We find the annulus for SKA and 
LOFAR to make almost no difference to the cosmological 



constraints, and therefore focus on how to optimize only 
the remaining two parameters Rq and i?j n . 

Table HXl shows that the optimal layout for OPT model 
is close to a giant core, with the inner core much smaller 
than the previously proposed. For MID model, LO- 
FAR and SKA still favors the quasi-giant-core layout, 
but MWA favors a large core whose radius is about the 
size that was previously proposed. The accuracies in m v 
varies in the OPT model by a factor of 3 for LOFAR, 
1.4-1.5 for MWA and SKA, and in the MID model by a 
factor of 3 for LOFAR, 1.3 for MWA and 2.2 for SKA. 
This means that an optimal configuration can improve 
the constraints by a factor up to 3 in noise dominated 
experiments, and up to 2 in signal dominated experi- 
ments. 

The plots have three interesting features. First, the 
configuration of a quasi-giant core is gcncrically favored. 
The reason for this is that the noise on the tempera- 
ture in an observing cell with u± is inversely propor- 
tional to the square root of the number of baselines that 
probe this u±. A compact array increases the number 
of baselines that probe small u±, reducing the overall 
noise level on these modes. Second, a couple of the up- 
coming 21 cm experiments favor the configuration that 
is close but not identical to a giant core. The reason 
for this is because arrays become sample variance lim- 
ited once they have a certain number of baselines that 
probe a given u±. A simple estimate on the signal-to- 
noise ratio for a compact MWA shows that on average 
^ss/Pn ~ 5 at the k ~ 0.1 Mpc -1 and ^ss/Pn « 1/40 
at the k ~ 0.7 Mpc -1 . Although moving more antennae 
to the center can increase the signal-to-noise, the error 
cannot be reduced as much if modes are already domi- 
nated by signal. Third, in the MID model, MWA favors 
a less compact core. This fact is due to the mixing be- 
tween cosmological and ionization parameters. Remem- 
ber that the off-diagonal elements in the Fisher matrix 
are proportional to the magnitude of ionization power 
spectra — the smaller the magnitude, the smaller degra- 
dation factor and the more accurate is the cosmological 
parameter measurement. Figure [5] illustrates that the 
ionization power spectrum generically falls off at large k 
such that a relatively large core, which is more sensitive 
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I: 





FIG. 8: Same as Figure[f]but for MID model. Figures are for LOFAR(left panel), MWA (middle panel) and SKA(right panel). 



to these large k, may actually improve parameter con- 
straints. This factor appears to be important for MWA 
because, as Figure 2] shows, a compactified MWA only 
occupies a rather narrow band in fc-space. This means 
that MWA has to expand significantly in order to use 
much more large k modes. 



It came to our attention that Lidz et al. [38| performed 
an analysis of the optimal configuration for MWA. Lidz 
et al. [38| concludes that the optimal layout for MWA 
is a giant core. This conclusion is slight different than 
ours; we find a compact but not exactly a giant core is 
optimal for MWA. The work in |38| defines the optimal 
configuration to be the configuration that maximizes the 
total signal-to-noise, while our definition is based on pa- 
rameter constraints. In addition, the conclusion in [381 ] 
is based on the comparison of a giant core array config- 
uration to one without a giant core, while we investigate 
a range of plausible configurations. It should be pointed 
out that both approaches should be tested with detailed 
simulations. 



F. Varying collecting area 

The survey volume and the noise per pixel are both 
affected by changing the collecting area A e because 
the solid angle a survey observes is w X 2 /A e and 
oc l/Al (Eq. (1551 )). For noise-dominated experi- 

-3/2 



P N cx l/A 2 e (Eq. (33)). 
ments, SP AT /P AT oc P N /y 



"AT/ -TAT OC JT- /y/Nc OC A e 1 j V A e 1 = A e 

and, for signal-dominated experiments, d Pat /Pat oc 
oc A e . If we parametrize the scaling of the 
error on a cosmological parameter as Ap oc {Ae) 13 , we 
have —1.5 < (3 < 0.5. A caveat is FFTT which has 
fixed fl = 2-7T, so SPat / Pat oc A® (signal dominated) or 
SPat/Pat oc l/Al (noise dominated). Since FFTT is 
nearly signal dominated, (3 < for FFTT. 

We show how collecting area affects the accuracy in Ta- 
ble H (OPT model) and ED (MID model). In the OPT 
model, it appears that (3 as ™0.4 for LOFAR, \(3\ < 0.2 

for MWA, \P\ < 0.3 for SKA, and 0.1 for FFTT. 

In the MID model, it appears that ~ -1.3 for LOFAR, 

[3 0.5 for MWA, (3 0.6 for SKA, f3 0.3 for 

FFTT. These exponents are compatible with the above 
arguments. The upshot is that varying A e does not sig- 



17 



TABLE X: How cosmological constraints depend on collecting 
areas in the OPT model. Same assumptions as in TablelVlbut 
for different collecting areas A e and assume only OPT model. 
The exponent /3 tells the rule of thumb of the A^-dependence 
of marginalized errors Ap, assuming Ap oc (Ae)' 3 . 



2.0 0.020 0.24 0.40 0.048 0.80 

LOFAR 1 0.025 0.27 0.44 0.063 0.89 

0.5 0.039 0.40 0.62 0.10 1.3 

/3 -0.48 -0.37 -0.32 -0.53 -0. 35" 

2.0 0.057 0.11 0.22 0.021 0.41 

MWA 1 0.046 0.11 0.19 0.022 0.37 

0.5 0.042 0.11 0.19 0.027 0.37 

P (122 0.11 -0.18 0.0 7 

2.0 0.0027 O048 O099 0.0077 019 

SKA 1 0.0038 0.044 0.083 0.0079 0.16 

0.5 0.0043 0.043 0.076 0.0089 0.15 

f} -0.34 0.08 0.19 -0.10 0.1 7" 

2.0 0.00014 0.0031 0.0082 0.00037 0.015 

FFTT 1 0.00015 0.0032 0.0084 0.00040 0.015 

0.5 0.00017 0.0035 0.0086 0.00046 0.016 

P -0.14 -0.09 -0.03 -0.16 -0.05 



a Ag d refers to the fiducial values assumed in Table llVl and are not 
the same for different arrays. 

TABLE XI: How cosmological constraints depend on collect- 
ing areas in the MID model. Same assumptions as in Table 
|V]but for different collecting areas A e and assume only MID 
model. The exponent /3 tells the rule of thumb of the A e - 
dependence of marginalized errors Ap, assuming Ap oc (Ae) 13 . 





Ae /A^ 


aq a 


Aln(Q m h 2 ) Aln(n b h 2 ) 


An s 


A In A s 




2.0 


0.086 


0.044 


0.072 


0.15 


0.35 


LOFAR 


1 


0.13 


0.083 


0.15 


0.36 


0.80 




0.5 


0.26 


0.17 


0.35 


0.92 


2.0 




P 


-0.80 


-0.98 


-1.1 


-1.3 


-1.3 




2.0 


0.21 


0.015 


0.025 


0.073 


0.61 


MWA 


1 


0.22 


0.017 


0.029 


0.097 


0.76 




0.5 


0.26 


0.026 


0.045 


0.16 


1.3 




P 


-0.15 


-0.40 


-0.42 


-0.57 


-0.55 




2.0 


0.013 


0.0049 


0.0079 


0.0092 


0.032 


SKA 


1 


0.014 


0.0049 


0.0081 


0.012 


0.037 




0.5 


0.016 


0.0063 


0.011 


0.022 


0.053 




P 


-0.15 


-0.18 


-0.24 


-0.63 


-0.36 




2.0 


0.00036 


0.00037 


0.00061 


0.00032 


0.0012 


FFTT 


1 


0.00041 


0.00038 


0.00062 


0.00036 


0.0013 




0.5 


0.00052 


0.00041 


0.00066 


0.00046 


0.0016 




P 


-0.27 


-0.07 


-0.06 


-0.26 


-0.21 



TABLE XII: How cosmological constraints depend on obser- 
vation time in the OPT model. Same assumptions as in Ta- 
ble [V] but for different observation time to and assume only 
OPT model. The exponent e tells the rule of thumb of the in- 
dependence of marginalized errors Ap, assuming Ap oc (to)~ e - 
to is in units of 4000 hours. 





to 


AQ A 


Aln(Q m fe 2 ) 


Aln(n b h 2 ) 


Ans 


A In A s 




4.0 


0.014 


0.17 


0.28 


0.034 


0.56 


LOFAR 


1 


0.025 


0.27 


0.44 


0.063 


0.89 




0.25 


0.055 


0.56 


0.88 


0.14 


1.8 






0.49 


0.43 


0.41 


0.51 


0.42 




4.0 


0.040 


0.081 


0.16 


0.015 


0.29 


MWA 


1 


0.046 


0.11 


0.19 


0.022 


0.37 




0.25 


0.059 


0.15 


0.27 


0.038 


0.52 




€ 


0.14 


0.22 


0.19 


0.34 


0.21 




4.0 


0.0019 


0.034 


0.070 


0.0054 


0.13 


SKA 


1 


0.0038 


0.044 


0.083 


0.0079 


0.16 




0.25 


0.0060 


0.061 


0.11 


0.013 


0.21 




e 


0.41 


0.21 


0.16 


0.32 


0.17 




4.0 


0.00014 


0.0031 


0.0082 


0.00037 


0.015 


FFTT 


1 


0.00015 


0.0032 


0.0084 


0.00040 


0.015 




0.25 


0.00017 


0.0035 


0.0086 


0.00046 


0.016 




e 


0.07 


0.04 


0.02 


0.08 


0.02 



TABLE XIII: How cosmological constraints depend on ob- 
servation time in the MID model. Same assumptions as in 
Table [V] but for different observation time to and assume only 
MID model. The exponent e tells the rule of thumb of the in- 
dependence of marginalized errors Ap, assuming Ap oc (to)~ e - 
to is in units of 4000 hours. 





to 


An A 


Aln(Q m h 2 ) Aln(Q b h 2 ) 


An s 


A In A s 




4.0 


0.061 


0.031 


0.051 


0.11 


0.25 


LOFAR 


1 


0.13 


0.083 


0.15 


0.36 


0.80 




0.25 


0.36 


0.24 


0.50 


1.3 


2.9 




e 


0.64 


0.74 


0.82 


0.89 


0.88 




4.0 


0.15 


0.010 


0.017 


0.052 


0.43 


MWA 


1 


0.22 


0.017 


0.029 


0.097 


0.76 




0.25 


0.36 


0.037 


0.064 


0.23 


1.8 




e 


0.32 


0.47 


0.48 


0.54 


0.52 




4.0 


0.0089 


0.0035 


0.0056 


0.0065 


0.022 


SKA 


1 


0.014 


0.0049 


0.0081 


0.012 


0.037 




0.25 


0.023 


0.0090 


0.015 


0.031 


0.075 




e 


0.34 


0.34 


0.36 


0.56 


0.44 




4.0 


0.00036 


0.00037 


0.00061 


0.00032 


0.0012 


FFTT 


1 


0.00041 


0.00038 


0.00062 


0.00036 


0.0013 




0.25 


0.00052 


0.00041 


0.00066 


0.00046 


0.0016 




€ 


0.13 


0.04 


0.03 


0.13 


0.10 



A e /Ag d a AQ A Aln(n m fe 2 ) Aln(tt b h 2 ) An s A In A. 



nificantly affect parameter constraints. 

G. Varying observation time and system 
temperature 

The detector noise is affected by changing the obser- 
vation time and system temperature. From Eq. (|33jl . the 
noise P N oc T^Jt Q . Therefore, for noise dominated ex- 
periments, 5Pat/Pat oc P N /\fW c oc T S y S /t , and for 
signal dominated experiments, 6 Pat /Pat oc \/\/W c oc 
(T S y S /to)°. Assuming that errors in cosmological param- 



eter Ap oc (T^ys/toY, we have < e < 1. 

Since T^ ys and t$ 1 shares the same exponent, we eval- 
uate the e by varying only to in Table IXIll (OPT model) 
and lXIIll (MID model). It appears that in average e ~ 0.5 
for LOFAR, e - 0.3 for MWA, e - 0.3 for SKA, e < 0.1 
for FFTT in the OPT model, and e ~ 0.8 for LOFAR, 
e - 0.5 for MWA, e - 0.4 for SKA, e < 0.1 for FFTT 
in the MID model. These exponents are compatible with 
the expected < e < 1 from the above argument. The 
upshot is that the order unity changes in T sys and to 
play a marginal role in the accuracy for future signal- 
dominated experiments. 



18 



H. Varying foreground cutoff scale fc m i n 

Finally, we test how accuracy is affected by varying 
fcmin above which foregrounds can be cleaned from the 
signal. One expect that the constraints tend to approach 
asymptotic values at small enough fc m i n . However, the 
most effectively constrained modes are at small k (k ~ 
0.1 Mpc -1 ) for noise dominated experiments, while the > 
contributions from larger k modes are more important for 
cosmic variance limited experiments. This means that 
fc m in affects the noise dominated experiments most. Left 
panel of Figure [6] illustrates this by plotting cosmological 
constraints as a function of the relative minimum cutoff 
K m ; n = fc m i n x y(z)B{z) /2tt which is a constant scale 



alone in the OPT model, the optimal array configuration 
can affect accuracies up to a factor 3 (Figure [7]) , redshift 
ranges affect it by up to a factor of 5 (Table IVIIj) , and 
residual foregrounds affect it by up to a factor of 10 (Fig- 
ure [6l left panel) . In summary, the assumptions can be 
crudely ordered by importance as ionization power mod- 
eling 3> foregrounds ~ redshift ranges ~ array layout 



to 



non- Gaussianity. 



B. Outlook 



factor for all z-bins by definition. The slopes at K m in = 1 
are rather large for MWA (varying from K m j n = 0.5 to 2, 
the error in n s varies from 0.032 to 0.39, about 10 times 
larger). For a signal dominated experiment like SKA, 
the constraints can be off by a factor of 3, or FFTT by a 
factor of 1.6. This suggests that in general fc m i„ is among 
top factors to affect cosmological constraints. 



IV. CONCLUSION & OUTLOOK 



A. Which assumptions matter most? 



1.5 x 2.3 x 



6.4 x 
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OPT 
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MID 
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1.7 x 2.1 x 



3 x 
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All OPT All 
OPT MID 



PESS 



All 

j^j S (Planck+FFTT) 



FIG. 9: Cartoon showing how cosmological parameter mea- 
surement accuracy depends on various assumptions. The 
cases labeled merely "PESS" or "OPT" have the PESS/OPT 
ionization power spectrum modeling with MID assumptions 
for everything else. 

In Section IIII1 we have quantified how cosmological 
parameter measurement accuracy depends on assump- 
tions about ionization power modeling, reionization his- 
tory, redshift range, experimental specifications such as 
the array configuration, and astrophysical foregrounds. 
We now return to the overarching question from Sec- 
tion [I] that motivated our study: among these assump- 
tions, which make the most and least difference? 

To quantify this, we consider two of the parameters 
for which 21cm tomography has the most potential for 
improving on Planck CMB constraints based on our es- 
timates: fik and m u . Figure M shows Af2k based on 
data from Planck plus SKA as well as Am„ from Planck 
plus FFTT. Varying the ionization power modeling from 
PESS to OPT models improves the constraints on these 
two parameters by a factor of 6-15. From 21cm data 



We have investigated how the measurement of cosmo- 
logical parameters from 21 cm tomography depends on 
various assumptions. We have found that the assump- 
tions about how well the reionization process can be 
modeled are the most important, followed by the assump- 
tions pertaining to array layout, IGM evolution, and fore- 
ground removal. 

Our results motivate further theoretical and experi- 
mental work. On the theoretical side, it will be valu- 
able to develop improved EoR data analysis techniques. 
The OPT approach is restricted to when neutral fraction 
fluctuations are not important, which is not an accurate 
approximation during the EOR. On the other hand, al- 
though the PESS approach is in principle insensitive to 
our poor understanding of reionization by marginalizing 
over it, in practice this approach destroys too large a frac- 
tion of the cosmological information to be useful. Hope- 
fully more detailed EoR simulations will enable our MID 
approach to be further improved into a phenomenological 
parametrization of our ignorance that is robust enough 
to be reliable, yet minimizes the loss of cosmological in- 
formation. 3 

On the experimental side, there are numerous compli- 
cations that are beyond the scope of this paper, but that 
are important enough to deserve detailed investigation in 
future work. To what extent can radio-frequency inter- 
ference be mitigated, and to what extent does it degrade 
cosmological parameter accuracy? This is particularly 
important for instruments in densely populated parts of 
the world, such as LOFAR. To what extent is the sub- 
traction of the foreground point sources hampered by the 
complicated off-center frequency scaling of the synthe- 
sized beam? To what extent does the dramatic variation 
of the synchrotron brightness temperature across the sky 
affect our results and optimal array design? Performing 
a realistic end-to-end simulation of possible experiments 
(from sky signal to volts and back) should be able to 
settle all of these issues. 

These are difficult questions, but worthwhile because 
the potential for probing fundamental physics with 21 
cm tomography is impressive: a future square kilometer 



3 It is also possible to constrain cosmological parameters using 
lensing of 21cm fluctuations [iij l47l lil. |43| . 
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array optimized for 21 cm tomography could improve the 
sensitivity of the Planck CMB satellite to spatial curva- 
ture and neutrino masses by up to two orders of magni- 
tude, to AQ k « 0.0002 and Am, w 0.007 eV, and detect 
at 4<7 the running of the spectral index predicted by the 
simplest inflation models. 
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APPENDIX A: x 2 GOODNESS OF FIT IN THE 
MID MODEL 

In this appendix, we elucidate some issues in sepa- 
rating cosmological information from astrophysics in the 
MID model, and give the x 2 goodness-of-fit test. 

The parametrization of ionization power spectra is 
based on the assumption that these power spectra are 
smooth functions of k, and therefore can be parametrized 
with as many parameters as necessary to fit the data at 
some accuracy. However, the separation of cosmology 
from astrophysics implicitly depends on another assump- 
tion that the shapes of ionization power spectra are dis- 
tinguishable from that of matter power spectrum, since 
one can only measure the total 21cm power spectrum. 
Albeit sometimes the shape may be similar at small k 
(see the plateaus in the ratios of power spectra in Figure 
[2]) , the slope and amplitude of ionization power spectrum 
at the fall-off region can in principle distinguish nuisance 
functions from the matter power spectrum, determine the 
overall amplitude, and in return use the data at small k 
to further constrain the nuisance parameters that corre- 
spond to the amplitudes. 

An avalanche of data from upcoming 21cm experi- 
ments can make it possible to justify the MID model with 



some parametrization of ionization power spectra. There 
are standard statistical methods for testing whether the 
parametrization is successful. We now give a compact 
description of the y 2 goodness-of-fit test, and refer inter- 
ested readers to [4l| for a useful review on the statistics. 
Note that we did not implement the \ 2 test in this paper 
since this would need observational data. The description 
of x 2 test below is intended to complement the discus- 
sions of the MID model in the main part of this paper. 
We want to test the hypothesis Hq that the parametriza- 
tion with fitting parameter values is an accurate account 
of the ionization power spectra. The parameter vector to 
be fitted is 6 = (A, (i = 1, . . . ,Np),/3 a (a = 1, ... ,n ion )), 
where N p and n lon are the number of cosmological and 
ionization parameters, respectively. The observed data 
vector is y = (yi, . . . , yjsr) where = PatO^,) at each 
pixel ki labeled by i = 1, ...,N, where N is the to- 
tal number of pixels. Assuming the Gaussian statis- 
tic in the measurements, the corresponding vector F 
for the expected value is F(kj;0) = (^ss — 2^x8 + 
-I- 2(^ss — ^xs)(J- 2 + £^88 M 4 ) an d the variance is 
of = (<5P A T(k l )) 2 = ^[P A T(k,) + P N (k l± )} 2 . We can 
now compute \ 2 '- 

X 2 (e) = ( y -F(e)) T C- 1 (y-F(6)), (Al) 

where C is the covariance matrix. If each measurement yi 
is independent, then C becomes diagonal with Cu = of. 
Then Eq. (|Aip is simplified to be 

*m-± *- (A2> 

We can define the p-value as the probability, under the 
assumption of the hypothesis Hq, of obtaining data at 
least as incompatible with Hq as the data actually ob- 
served. So 

/>oo 

P= / f(z;n d )dz, (A3) 
Jx 2 (o) 

where f(z; nj) is the \ 2 probability density function 
(p.d.f.) with rid degrees of freedom rid = N — (N p + ni on ). 
Values of the \ 2 P-d.f. can be obtained from the CERN- 
LIB routine PROB [42]. To set the criterion, a fit is good 
if p > 0.95, i.e. the real data fit the parametrization bet- 
ter than the 95% confidence level. 
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